Weka4WS: A WSRF-Enabled Weka Toolkit for Distributed Data Mining on Grids
نویسندگان
چکیده
This paper presents Weka4WS, a framework that extends the Weka toolkit for supporting distributed data mining on Grid environments. Weka4WS adopts the emerging Web Services Resource Framework (WSRF) for accessing remote data mining algorithms and managing distributed computations. The Weka4WS user interface is a modified Weka Explorer environment that supports the execution of both local and remote data mining tasks. On every computing node, a WSRF-compliant Web Service is used to expose all the data mining algorithms provided by the Weka library. The paper describes the design and the implementation of Weka4WS using a first release of the WSRF library. To evaluate the efficiency of the proposed system, a performance analysis of Weka4WS for executing distributed data mining tasks in different network scenarios is presented.
منابع مشابه
WSRF Services for Composing Distributed Data Mining Applications on Grids: Functionality and Performance
The Web Services Resource Framework (WSRF) has recently emerged as the standard for the implementation of Grid applications. WSRF can be exploited for developing high-level services for distributed data mining applications. This paper describes Weka4WS, a framework that extends the widely-used Weka toolkit for supporting distributed data mining on WSRF-enabled Grids. Weka4WS adopts the WSRF tec...
متن کاملThe Weka4WS framework for distributed data mining in service-oriented Grids
The service oriented architecture (SOA) paradigm can be exploited for the implementation of data and knowledge-based applications in distributed environments. The Web Services Resource Framework (WSRF) has recently emerged as the standard for the implementation of Grid services and applications. WSRF can be exploited for developing high-level services for distributed data mining applications. T...
متن کاملExploiting need of Service-Oriented Framework for Executing Data Mining Services
Weka4WS adopts the emerging Web Services Resource Framework (WSRF) for accessing remote data mining algorithms and managing distributed computations. The Weka4WS user interface is a modified Weka Explorer environment that supports the execution of both local and remote data mining tasks. Workflow environments are widely used in data mining systems to manage data and execution flows associated t...
متن کاملAnalysis and Design of Service-Oriented Framework for Executing Data Mining Services on Grids
Data mining services on grids is the need of today’s era. Workflow environments are widely used in data mining systems to manage data and execution flows associated to complex applications. Weka, one of the most used open-source data mining systems, includes the Knowledge-Flow environment which provides a drag-and-drop inter-face to compose and execute data mining workflows. It allows users to ...
متن کاملMeta-learning in Grid-based Data Mining Systems
The Weka4GML framework has been designed to meet the requirements of distributed data mining. In this paper, we present the Weka4GML architecture based on WSRF technology for developing meta-learning methods to deal with datasets distributed among Data Grid. This framework extends the Weka toolkit to support distributed execution of data mining methods, like meta-learning. The architecture and ...
متن کامل